Computer and Modernization ›› 2012, Vol. 1 ›› Issue (11): 171-176.doi: 10.3969/j.issn.1006-2475.2012.11.042

• 网络与通信 • Previous Articles     Next Articles

Research and Implementation of Distributed Full-text Retrieval System Based on Solr

LI Dai-wei, LI Ning   

  1. Department of Information Technology and Application System, North China Institute of Computing Technology, Beijing 100083, China
  • Received:2012-07-13 Revised:1900-01-01 Online:2012-11-10 Published:2012-11-10

Abstract: With the rapid growth of network information resources, traditional retrieval system has been difficult to provide efficient and reliable services to the mass data. In response to this situation, this paper designs a distributed full-text retrieval system based on Solr. The system uses a Web crawler to collect information which is stored as text files. Then the system creates indexes in parallel on multiple computers through Solr index module. It turns out that the design improves the indexing speed effectively. The system improves the retrieval performance by applying Zookeeper management and distributed design in search module. Finally a user-friendly interface is designed. Currently, the system can operate millions of data stably and has a strong practical value.

Key words: full-text search, Solr, distribution, Zookeeper

CLC Number: